Discovering Mid-level Visual Connections in Space and Time
نویسندگان
چکیده
Finding recurring visual patterns in data underlies much of modern computer vision. The emerging subfield of visual category discovery/visual data mining proposes to cluster visual patterns that capture more complex appearance than low-level blobs, corners, or oriented bars, without requiring any semantic labels. In particular, mid-level visual elements have recently been proposed as a new type of visual primitive, and have been shown to be useful for various recognition tasks. The visual elements are discovered automatically from the data, and thus, have a flexible representation of being either a part, an object, a group of objects, etc. In this chapter, we explore what the mid-level visual representation brings to geo-spatial and longitudinal analyses. Specifically, we present a weakly-supervised visual data mining approach that discovers connections between recurring mid-level visual elements in historic (temporal) and geographic (spatial) image collections, and attempts to capture the underlying visual style. In contrast to existing discovery methods that mine for patterns that remain visually consistent throughout the dataset, the goal is to discover visual elements whose appearance changes due to change in time or location; i.e., exhibit consistent stylistic variations across the label space (date or geo-location). To discover these elements, we first identify groups of patches that are style-sensitive. We then incrementally build correspondences to find the same element across the entire dataset. Finally, we train style-aware regressors that model each element’s range of stylistic differences. We apply our approach to date and geo-location prediction and show substantial improvement over several baselines that do not model visual style. We also demonstrate the method’s effectiveness on the related task of fine-grained classification. Yong Jae Lee Dept. of Computer Science, UC Davis e-mail: [email protected] Alexei A. Efros Dept. of Electrical Eng. and Computer Science, UC Berkeley e-mail: [email protected] Martial Hebert Robotics Institute, Carnegie Mellon University e-mail: [email protected]
منابع مشابه
Mid-level Representation for Visual Recognition
Visual Recognition is one of the fundamental challenges in AI, where the goal is to understand the semantics of visual data. Employing mid-level representation, in particular, shifted the paradigm in visual recognition. The mid-level image/video representation involves discovering and training a set of mid-level visual patterns (e.g., parts and attributes) and represent a given image/video util...
متن کاملRecognition of Visual Events using Spatio-Temporal Information of the Video Signal
Recognition of visual events as a video analysis task has become popular in machine learning community. While the traditional approaches for detection of video events have been used for a long time, the recently evolved deep learning based methods have revolutionized this area. They have enabled event recognition systems to achieve detection rates which were not reachable by traditional approac...
متن کاملNonlinear inelastic dynamic analysis of space steel frames with semi-rigid connections in urban buildings
Applied studies addressing semi-rigid connections have been limited. Scant information exists in regulations except little brief information. Therefore, this research analyzes the behavior of three-dimensional steel frames and semi-rigid connections based on beam-column method and non-linear dynamic analysis. Stability functions and geometric stiffness matrix were used to study the non-linear g...
متن کاملIntroducing a Lightweight Structural Model via Simulation of Vernacular “Pa Tu Pa” Arch
The knowledge of Iranian vernacular structures is based on geometry, and there is a possibility of recreating such structural patterns aimed at producing movable structures. The purpose of this research was to utilize the patterns of vernacular structures to provide a lightweight structural model. The questions raised included how to create various forms based on the structural history of any r...
متن کاملAnalysis of the Interaction between the Formal Types of Traditional Houses and Spatial Configurations using Space Syntax Case Study: Traditional Houses of Kashan
Kashan is a city in Iran with a host of traditional houses that date to the Qajar period. Sociocultural and climatic factors have had the greatest impact on their spatial configurations. In this study, focusing on entrances, the spatial configuration of houses in Kashan is analyzed. Houses are first categorized in terms of physical form in four categories: single-courtyard houses with massing ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016